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site for user-specified data in a remote database 
at a second network site and conducts the search 
at a third network site (e.g., at a host computer's 
site). To begin, the system receives, at the first 
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the user-specified data is input at the first network 
site, following which the user-specified data and the 
provider identifier are output from the first network 
site to the third network site. The system then 
searches for the user-specified data in a database at 
the third network site using the provider identifier. 
This database at the third network site includes 
data that corresponds to data stored in the remote 
database at the second network site. 
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SEARCH AND INDEX HOSTING SYSTEM 

5 

BACKGROUND OF THE INVENTION 

Field Of The Invenrinn 

The present invention relates to a system for searching a 

10 remote database, and more particularly to a system which indexes 

documents in the database, which identifies documents in the index that 
include user-specified data, and which outputs a list of documents that 
contain such data and, optionally, excerpts from those documents. The 
invention has particular utility in connection with text indexing and retrieval 

15 systems, such as World Wide Web search engines. 

Description Of The Related Art 

In general, search engines search through a database for 
specific data and retrieve titles of documents in the database which contain 

20 that data. For example. World Wide Web search engines, such as 

AltaVista™ and Yahoo!®, provide users with the ability to search the Web 
for documents containing user-specified words, phrases, or the like. 
However, conventional search engines, and Web search engines in 
particular, suffer from a drawback in that they do not allow a user to direct 

25 a search to a single database. 

More specifically, conventional Web search engines, such as 
those noted above, operate by generating an index for all sites on the Web, 
and then retrieving data from that index in response to user queries. Since 
these search engines generate an index for all sites on the Web, however, 

30 they are limited to searching the entire Web. This is disadvantageous, 

particularly for those users who only want to search specific sites. 



wo 00/08572 



PCT/US99/173S9 



10 



In response to the foregoing drawbacks in the art, software 
manufacturers have developed site-specific searching systems, such as 
Ultraseek™, which ostensibly allow users to limit their searches to specific 
Web sites. To use these systems, however, they must be installed at each 
Web site, e.g., by the Web site's provider. Once installed, the systems 
create an index of the Web site at the provider's location. Thereafter, 
when a user accesses the Web site and inputs a search query, code at the 
site searches that index for the query, and relays the results of the search 
back to the user. 

While the foregoing types of site-specific searching systems 
address some of the problems associated with conventional Web search 
engines, such as Yahoo!® and the like, they have several drawbacks. For 
example, their installation and subsequent maintenance can be costly and 
time consuming. As a result, Web site providers often choose not to install 
15 such systems at their sites. Additional problems arise with these systems in 

cases where a Web site is maintained by a Web site hosting company, as 
opposed to by the provider itself. That is, in these cases, in addition to the 
above problems, problems relating to licensing and the like arise, which 
make it difficult to implement conventional site-specific searching systems 
20 in a cost-effective manner. 

In addition to the foregoing drawbacks, conventional 
database searching systems provide the user with only names/titles of 
documents in response to a query. For example, conventional Web search 
engines provide only the name of a document containing a search term, 
25 together with a uniform resource locator ("URL") for that document. As a 

result, it is not always possible for the user to determine which of the 
retrieved documents is relevant without acmally linking to, and opening, 
the document. This can slow down searching significantly. 

Thus, there exists a need for a searching system which 
30 provides the user with the ability to search a portion of a database, such as 
one or more sites on the World Wide Web, and which is more cost 
effective, efficient, and easy to use than the conventional systems described 
above. In addition, there exists a need for a database searching system 
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which is able to provide a user with the context of each search term in 
documents retrieved as a result of the search. 

SUMMARY OF THE INVENTION 
5 The present invention addresses the foregoing needs by 

providing a way to search through a database at one network site (e.g., a 
Web site) using a host computer which is at another network site. By 
hosting the search at a separate site, the present invention facilitates site- 
specific searching, as described below. 

10 More specifically, according to the present invention, a Web 

site provider, for example, is able to create a search engine for the Web 
site simply by accessing the present invention via the Web and entering a 
request for a new account. In response to this request, the invention 
assigns the Web site a provider identifier, and then extracts URL(s) from 

15 the Web site. Thereafter, the invention "crawls" through the site in order 

to create an index of the site, which comprises data from the site indexed 
by document (e.g., Web page) and provider identifier. Once the indexing 
process has been completed, the site provider need simply copy a few 
(e.g., 10) lines of code into any sites for which searching capabilities are 

20 desired. 

Following the foregoing (i.e., the setup), each time the Web 
site is visited, it will automatically transmit its provider identifier to the 
visiting user's site. In addition, the Web site will display a search line, 
from which the visiting user may enter queries to search the site for 

25 specific data. When such a query is entered, the query, together with the 

provider identifier, is passed from the user's site to the host computer's 
site, where the actual searching takes place. Specifically, at the host 
computer's site, an index corresponding to the provider identifier is 
retrieved from memory and searched for the data specified in the user's 

30 query. Thereafter, a list of documents which contain the data (including 

URLs in the case of the Web) is output from the host computer's site to the 
user's site and displayed there. 

By conducting the search at the host computer's site, rather than 
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at the Web site itself, the present invention reduces the difficulties involved 
with installing and maintaining an entire software application at the Web site. 
As a result, the present invention provides a way to search specified Web sites 
(and other types of databases as well), which is more efficient and less costly 
and time consuming than the conventional she-specific searching systems 
described above. 

Thus according to one aspect, the present invention is a 
system (i.e., a method, an apparatus, and computer-executable process 
steps) for initiating a search at a first network site for user-specified data in 
a remote database at a second network site and for conducting the search at 
a third network site (e.g., at a host computer's site). To begin, the system 
receives, at the first network site, a provider identifier associated with the 
database from the second network site. Thereafter, the user-specified data 
is input at the first network site, following which the user-specified data 
and the provider identifier are output from the first network site to the thu-d 
network site. The system then searches for the user-specified data m a 
database at the third network site using die provider identifier. In the 
invention, this database at the third network site includes data that 
corresponds to data stored in the remote database at the second network 
20 site. 

According to another aspect, the present invention is a way 
to configure a computerized searching system (such as the searching system 
resident at the host computer's site described above) so that the searching 
system can be used to search a database. In this aspect of the invention, 

25 information identifying the database is input, a provider identifier is 

assigned to the database, and a search through the database is conducted 
using the input information in order to identify locations of documents in 
the database. Thereafter, the locations of the documents in the database are 
stored in memory together with the provider identifier, and the documents 

30 in the database are indexed. An index of the documents is then stored in 

memory together with the provider identifier; and data corresponding to 
data in the database is also stored in memory together with the provider 
identifier. The provider identifier is then output to the database. As noted 
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above, this provider identifier is transmitted to those who visit the site. 

According to still another aspect, the present invention is a 
system for identifying which documents in a database contain user-specified 
data. The system stores, in memory, indices of data in plural databases, 
such as those noted above. The system then receives the user-specified 
data and a provider identifier which corresponds to one of the plural 
databases, and retrieves, from memory, an index of data for a database that 
corresponds to the provider identified. Thereafter documents in the 
retrieved index that contain the user-specified data are identified, and 
identities thereof are output to the user. 

The present invention also provides an optional feature for 
displaying excerpts from documents identified by a database search. In this 
aspect of the invention, the index of documents in the database is stored, 
and pointers to data segments in the database are generated based-on the 
index. These data segments comprise target data together with data 
surrounding the target data. That is, assuming that the target data 
comprises a word which matches an input user query, the data segment for 
that word might comprise, e.g., five words to the left of the word, the 
word itself, and five words to the right of the word. When the invention 
searches the index for the word, it compiles a list of pomters to data 
segments which include the word. These data segments may then be 
extracted and passed to a user's site for display along with the list of 
documents. 

Thus, according to this aspect, the invention is a system for 
retrieving a list of documents in a database which include user-specified 
data, and of retrieving one or more data segments from each document on 
the list. The system includes storing an index of documents ft-om the 
database, the index including pointers corresponding to data in the 
database, where the pointers define data segments having a predetermined 
size. One or more documents in the database that contain the user- 
specified data are then identified based on the index; and a list is created 
which includes one or more pointers corresponding to each occurrence of 
the user-specified data in the identified documents. Each data segment in 
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the database that contains the user-specified data is extracted based on the 
list of pointers created in the creating step, whereafter a list of the 
documents in the database that contain the user-specified data is output, 
together with the extracted data segments. 

By virme of the foregoing, the invention makes it possible to 
display data excerpts (i.e., segments) from each documem found in the 
search. A user may then refer to these excerpts in order to determine 
whether each documem is relevant, instead of actually opening the 
document. As a result, the present invention facilitates database searching. 

This brief summary has been provided so that the nature of 
the invention may be understood quickly. A more complete understanding 
of the invention can be obtained by reference to the following detailed 
description of the preferred embodiment thereof in connection with the 
attached drawings. 

BRIEF DR SCRIPTION OF THE ORAWTNO^; 

Figure 1 is a perspective view of a host computer system, 
including a personal computer and a query server, which is used in the 
present invention. 

Figure 2 shows the architecture of the personal computer 
shown in Figure 1 . 

Figure 3 shows the architecture of the query server shown in 

Figure 1. 

Figure 4 is a flow diagram showing process steps for setting- 
up the search hosting system of the present invention on a Web site. 
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Figure 5 shows the software architecture used to implement 
the process steps shown in Figure 4. 

Figures 6A and 6B show a representative new account form 
used in setting-up the search hosting system of the present invention. 

Figure 7 shows an example of a Web site which does not 
include the search hosting system of the present invention. 

Figure 8 shows an example of a Web site which includes the 
search hosting system of the present invention. 

Figure 9 shows the software architecture of the search 
hosting system of the present invention. 

Figures I OA and lOB show process steps implemented by the 
software architecture of Figure 9 to perform a search of a Web site at a 
host computer system. 

Figure 1 1 shows operation of a concordance generator 
included in the software architecture of Figure 9. 

Figure 12 shows an example of a results page generated by 
the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
Figure 1 shows a representative embodiment of a host 
computer system which can be used to implement the present invention. 
As shown in Figure 1, host computer system 1 includes PC 2 and query 
server 4. PC 2 includes a local area network connection 5 for interfacing 
to query server 4, a general network connection 6 for interfacing to a 
network such as the Internet, and fax/modem connection 7 for interfacing 
with other remote sources. PC 2 also includes display screen 9 for 
displaying information, keyboard 10 for inputting text and user commands, 
mouse 11 for positioning a cursor on display screen 9 and for inputting 
user commands, disk drive 12 for reading from and writing to floppy disks 
installed therein, and CD-ROM drive 14 for accessing information stored 
on CD-ROM. PC 2 may also have one or more peripheral devices (not 
shown) attached thereto. 

Figure 2 shows the internal structure of PC 2, As shown in 
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Figure 2, PC 2 includes memory 15, which comprises one or more 
computer-readable media, such as a computer hard disk. Memory 15 
stores data 16, applications, and an operating system 17. Among the 
applications stored in memory 15 is results page generator 19, the function 
5 of which is described in detail below. Also included in PC 2 are display 

interface 20, keyboard interface 21, mouse interface 22, disk drive 
interface 24, CD-ROM drive interface 25, computer bus 26, RAM 27, and 
processor 29. Processor 29 preferably comprises a microprocessor or the 
like for executing applications out of RAM 27. As noted above, these 
10 applications may be stored in memory 15 or, alternatively, on a floppy disk 

in disk drive 12 or a CD-ROM in CD-ROM drive 14. In this regard, 
processor 29 accesses applications (or other data) stored on a floppy disk 
via disk drive interface 24 and accesses applications (or other data) stored 
on a CD-ROM via CD-ROM drive interface 25. 
15 Application execution and other tasks of PC 2 and query 

server 4 may be controlled and/or altered using keyboard 10 or mouse 11, 
commands from which are transmitted to processor 29 via keyboard 
interface 21 and mouse interface 22, respectively. Output results from 
some applications running on PC 2 may be processed by display interface 
20 20 and then displayed to a user on display 9. To this end, display interface 

20 preferably comprises a display processor for forming images based on 
data provided by processor 29 over computer bus 26, and for outputting 
, those images to display 9. Other applications, such as results page 
generator 19 receive their inputs from, and/or provide outputs to, network 
25 interfaces 5 and 6. 

Figure 3 shows a representative architecture for query server 
4. As shown in Figure 3, query server 4 includes processor 30, RAM 31, 
computer bus 32, and memory 34. Memory 34 preferably comprises one 
or more computer-readable media, such as a computer hard disk. Memory 
30 34 stores data and applications, with the applications including Web robot 

35, Web search engine 36, indexer module 37, and search and index 
hosting application 39, among others. Search and index hosting application 
39 comprises computer-executable process steps to generate and execute a 

-8- 
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database-specific search engine A detailed description of these process 
steps is provided below. 

In preferred embodiments of the invention, search engine 36 
and indexer module 37 are based upon the search engine and indexer 
5 described in "The SMART Retrieval System: Experiments in Automatic 

Document Processing" by Gerald Salton (Prentice-Hall, Inc. (1971)) and 
"A Theory of Indexing" also by Gerald Salton (J. W. Arrowsmith, Ltd. 
(1975)). The contents of these two documents are hereby incorporated by 
reference into the subject application as if set forth herein in full. Web 
10 robot 35 can comprise any conmiercially available Web robot. 

Applications such as search and index hosting application 39 
are executed by processor 30 out of RAM 31, and results therefrom are 
output to host computer 2 via network connection 5. In this regard, search 
and index hosting application 39 comprises a database-specific search 
15 engine, in that it identifies which documents in a database contain a user- 

specified query term. The application then retrieves a list of such 
documents and, optionally, one or more data segments from each document 
on the list. In the preferred embodiment of the present invention, the 
database searched by application 39 is a Web site and the documents 
20 comprise one or more Web pages. However, it is noted that the invention 
is not limited to searching Web sites and, in fact, can be used to search a 
variety of different types of databases. Moreover, it is noted that although 
the invention will be described with respect to retrieving documents from 
Web sites based on text/character strings, the invention is not limited to 
this either. That is, the invention may also be used to retrieve images or 
other figures from any type of database. 



25 
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Prior to performing site-specific searching using the present 
invention, it is necessary for a Web site provider to set up an account. 
Figures 4 and 5 show how this is done. More specifically, as shown in 
Figure 5, a Web site provider's representative, such as their system 
administrator, issues request 40, over network 41 (e.g., the Internet), to a 
host computer system running search and index hosting application 39. In 
response, search and index hosting application 39 retrieves Web page forms 
required to set up a new account from storage 42, and transmits these 
forms back to the provider's representative, where the forms are displayed 
on die representative's computer. As shown in Figure 5, displayed form 
44 requests "contact" information (e.g., information regarding the 
provider's Web site) and entry point URLs for the site. This contact 
information may be input in step S401 of Figure 4. 

Figures 6A and 6B show a representative example of a new 
account form generated by a preferred embodiment of the present 
invention, which will be sold under the tradename PicoSearch™. As shown 
in these figures, new account form 45 requests information regarding the 
provider's representative, such as their name. E-mail address, and account 
password, together with three entry point URL's to the provider's Web 
sue. Of course, the invention is not limited to using three entry point 
URLs, and even can be used without entering the contact information 
shown in Figure 6A. As shown in Figure 6B. the preferred embodiment of 
the invention also provides a way in which to control searching throughout 
a specified site. For example, in this embodiment, it is possible to restrict 
25 searching to the directory level, the server level, or the domain level. Of 

course, the invention is not limited to the restrictions shown in Figure 6B, 
and can be modified to include other searching options as well. 

Once the contact information and entry point URLs have 
been added to the new account form, the "fiUed-out" form is transmitted 
30 over the network to the host computer system. There, contact information 
46 and entry point URLs 48 are extracted from the new account form, as 
shown in Figure 5. The contact information (i.e., the provider 
representative's name, E-mail address, etc.) is then stored in user database 



20 
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49, which may reside on memory 34 or any other memory on the host 
computer system. Search and index hosting application 39 then assigns the 
provider representative a new provider identifier 50 in step S402. This 
provider identifier is then combined with entry point URLs 48, the 
combination of which is provided to Web robot 35. At about 

the same time or shortly thereafter, Web site provider 52 provides data 
from its Web site (e.g., plural Web documents/pages) to host computer 
system 1 via network 41. In step S403, Web robot 35 searches through the 
data from the Web site for locations (i.e., URLs) and titles of documents 
therein, starting at the entry point URLs specified in the new account form. 
Next, in step S404, the location and title of each Web document that is 
found in the search is stored in user database 49, together with the provider 
identifier and a predetermined document identifier for each document. The 
use of these document identifiers is described below. After Web robot 35 
15 has completed its "crawl" of the Web site, Web robot 35 passes provider 

identifier 50 for the Web site to indexer module 37. 

In step S405, indexer module 37 indexes data stored in 
documents on the Web site. More specifically, indexer module 37 reviews 
each of Web documents 55 identified by Web robot 35, and creates an 
20 index of pointers to data in the Web site, where the data is indexed 

according to document, location in the document, and provider identifier. 
This index, together with the provider identifier and predetermined 
document identifiers, is then stored in multiple index storage 56 in step 
S406. In preferred embodiments of the invention, multiple index storage 
56 resides in memory 34 of query server 4; although the invention is not 
limited to storing multiple index storage 56 there. 

In addition to indexing data in the manner described above, 
in step S407. indexer module 37 compresses data from the Web site and 
stores that data in compressed document ftill text storage 57, together with 
the provider identifier for the site. As was the case above, storage 57 can 
reside in memory 34; although the invention is not limited to this. Next, in 
step S408, indexer 37 outputs processing results 59, including the provider 
identifier, to the provider's representative, as shown in Figure 5. At this 
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point, it is noted that althougli Figure 5 sliows the provider's representative 
at a location which is different from that of the Web site provider, this 
need not be the case. That is, the provider's representative and the 
provider may also be at the same location. 

Once an account has been established for the Web site, the 
provider's representative should copy code provided by the host computer 
system into their Web site. A representative example of such code is 
shown.in the Appendix attached hereto. Among other things, this code 
permits the Web site to provide an interface to search and index hosting 
application 39 on the host computer system. As an illustration. Figure 7 
shows an example of a Web site 60 which does not include the site-specific 
searching capabilities of the present invention, while Figure 8 shows that 
same Web site with the site-specific searching capabilities. As shown in 
Figure 8, the site with the searching capabilities includes entry line 61 and 
search button 62. To search for a text or the like at Web site 60, all that 
must be done is to input the text on the entry line, and click on the search 
button The searching process of the present invention is described below 
with respect to the remaining figures. 

First, however, it is noted that although the invention has 
been described with respect to only one Web site provider, the invention is 
not limited to use with only one provider. In fact, the invention is able to 
handle searches of any number of Web sites. That is, any number of Web 
site providers' representatives may set up accounts with the host computer 
system. As a result, the host computer system may store indices of data 
from plural databases (e.g., plural Web sites), locations of documents in 
those plural databases, and compressed text from the plural databases. 
However, because the invention also stores with each of these a provider 
identifier, the invention is able to select the appropriate data readily, and 
thus to perform site-specific searches. This feature of the invention is 
described more fully below. 

In brief, as shown Figure 9, a user initiates a search at first 
network site 64 (e.g., the user's computer) for data in a remote database at 
second network site 65 (e.g., an Internet site or Web site server) and the 

-12- 
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invention conducts the search at third network site 66 (e.g.. the host 
computer system). At this point, it is noted that although Figure 9 shows 
the three network sites 64, 65 and 66 at three different locations (i.e., at 
three computer systems) on the network, these network sites may reside at 
5 the same network location or, alternatively, at only two different network 

locations. In any event, in accordance with the invention, first network site 
64 receives a provider identifier associated with the database from second 
network site 65 and then outputs a query containing user-specified data and 
the provider identifier from first network site 64 to third network site 66. 
10 Third network site 66 then searches for the user-specified data in a database 

at the third network site using the provider identifier. This database at 
third network site 66 includes data that corresponds to data stored in the 
remote database at the second network site. The above process is described 
in more detail below. 

15 More specifically, as shown in Figure 9, a user at first 

network site 64 initiates contact to Web site 65 via a Web browser (not 
shown), such as Netscape®. In response, Web site 65 automatically 
provides the user's computer with its provider identifier 67, together with 
Web page(s) 68 from the site. An example of such a page is shown in 

20 Figure 8. Thereafter, the user may enter a query 71 comprised, e.g., of 

user-specified data such as character strings, text, or the like, into the entry 
line provided with the Web page. The query and the provider identifier are 
then packetized and packet 72 is passed, via network 69 (e.g., the 
Internet), to the host computer (i.e., PC 2). The host computer receives 

25 this packet in step SI GDI of Figure lOA, extracts the provider identifier and 

data from the query, and passes this information on to query server 4 via 
local network 70 (i.e., over local network connection 5 shown in Figure 1). 

Thereafter, in step S1002, index retriever 74 uses provider 
identifier 67 to retrieve, from multiple index storage 56, an index of data 

30 that has the same provider identifier as the Web site. This index with the 

same provider identifier, labeled 75 in Figure 9, is then passed to search 
engine 36. along with query 71. In step S1003, search engine 36 searches 
through index 75 for data specified in query 71 in order to identify 
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documents that contain the specified data. The search engine then retrieves 
identifiers 76 (e.g., numbers) for these documents from the index. 
Thereafter, processing proceeds to step S1004. 

Step S1004 determines whether data segments in each 
5 identified document should be output along with a list of identified 

documents. In a case that step SI 004 determines that no data segments 
should be output, processing proceeds to step S1005 of Figure lOB. As 
shown in Figure 9, this processing corresponds to dashed line 77, which 
bypasses concordance generator 78 (described below). Thus, in step 

10 S1005, search and index hosting application 39 extracts locations (e.g., 

URLs) and titles of documents from user database 49 that have identifiers 
which match the documents identified in step SI 003. 

The extracted document locations and titles 79 are then 
output, via local network 70, to results page generator 19, In step S1006, 

15 results page generator 19 generates a results page showing titles of, and 

URL links to, documents on the Web site which contain the data specified 
in the query. This results page 80 is then passed to the user's computer via 
network 69, where it is displayed. In preferred embodiments of the 
invention, this results page also includes an entry line, such as entry line 81 

20 shown in Figure 9, from which the user may initiate a new search of the 

Web site. If no new searches are to be performed, processing ends. 
Otherwise, the foregoing process is repeated for the new search. 
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Returning to Figure lOA, in a case that data segments, such 
as text excerpts or the like, are to be output along with the list of 
documents, processing proceeds to step S1007. In this regard, as noted 
above, the present invention includes the capability of also outputting, from 
5 each document on the list, a text or data segment which contains the user- 

specified data. These segments comprise target data (i.e., the user- 
specified data) together with data adjacent to the target data. That is, 
assuming that the target data comprises a word which matches the user- 
specified data, the data segment for that word might comprise, e.g., five 

10 words to the left of the word, the word itself, and five words to the right of 

the word. Of course, the invention is not limited to using eleven- word data 
segments, meaning that any size data segment may be used. In fact, in 
preferred embodiments, the size and/or locations of the data segments 
relative to the user-specified data are settable by the user, and thus can be 

15 varied as desired. 

Thus, following step S1004, in step S1007 concordance 
generator 78 searches the documents identified in step S1003 for locations 
of the user-specified data. Figure 11 is a close-up view of the operation of 
concordance generator 78. More specifically, as shown in Figure 11, in 

20 step SI 007 concordance generator 78 performs a boolean search dirough 
documents 76 for data that matches data specified in query 71. When 
matching data is found, step SI 008 generates pointers into the 
corresponding compressed data based on pointers to locations in the index. 
The generated pointers identify the user-specified data in the compressed 

25 data, together with a predetermined amount of additional adjacent data 

needed to make up a data segment. 

Roughly concurrently, in step S1009. concordance generator 
78 extracts the provider identifier for the current Web site fi-om index 75, 
and selects and retrieves compressed data 82 for that Web site ft-om fiill 

30 compressed text storage 57. Thereafter, in step SlOlO, concordance 

generator 78 decompresses selected data segments that correspond to the 
pointers generated in step SI 008. In preferred embodiments of the 
invention, only compressed data corresponding to the selected data 
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segments is decompressed. As noted in Figure 11, a data segment may 
start m characters before the query term and end n characters after the 
query term - only this segment would be decompressed. Thereafter, arrays 

84 of concordance strings (i.e., decompressed data segments) are output 
ft-om concordance generator 78, together with corresponding document 
identifiers specifying the document from which each string originated. 

Arrays 84 of concordance strings are then output from query 
server 4 to results page generator 19 via local network 70. Results page 
generator 19 uses the decompressed data segments to formulate its results 
page. In addition, results page generator 19 extracts document identifiers 

85 corresponding to the decompressed data segments, and passes these 
document identifiers back to query server 4. In step SlOll, query server 4 
extracts locations (e.g.. URLs) and titles of documents from user database 
49 that match the document identifiers, and passes this information back to 
results page generator 19. Results page generator 19 formulates a results 
page which includes document titles, data segments, URLs, and the original 
query term, and passes the results page, together with the provider 
identifier, back to die user in step S1012. The results page is then 
displayed on the user's computer. As was the case above, the results page 
may include an entry line, from which the user may conduct another search 
of the Web site specified by the provider identifier. If no additional search 
is to be conducted, processing ends. Otherwise, the foregoing process is 
repeated for the new search. 

Figure 12 is an example of a results page generated by the 
preferred embodiment of the present invention. Specifically, this results 
page was generated from a search of the Web site shown in Figure 8, in 
which the user-specified character string was "labor and employment". As 
shown in Figure 12, the results page includes document titles 86a, 86b and 
86c, and corresponding URLs 87a, 87b and 87c for each document title. 
In addition, the results page shown in Figure 12 includes excerpts 88a, 88b 
and 88c from each document, which include the user-specified character 
string. As noted above, however, this feamre of the invention is optional, 
and the results page need not include these excerpts. As also shown in 
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Figure 12, the results page includes entry line 89, whereby the user may 
conduct a new search of the Web site shown in Figure 8 for a new search 
term. 

The present invention has been described with respect to a 
panicular illustrative embodiment. It is to be understood that the invention 
is not limited to the above-described embodiment and modifications thereto, 
and that various changes and modifications may be made by those of 
ordinary skill in the an without depaning from the spirit and scope of the 
appended claims. 
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APPENDIX 

<<!-- Begin PicoSearch Code --> 

< <P> < <FORM METHOD = "POST" 

5 ACTION = "http://www.picosearch.com/cgi-bin/is.pl" > 

< <INPUT TYPE="HIDDEN" NAME = "inciex" VALUE = "236"> 

< < TABLE BGCOLOR = "WHITE" CELLSPACING =0 
CELLPADDING = 0 BORDER =0 ><<TR><<TD> 

< < TABLE BGCOLOR = "WHITE" CELLSPACING =2 
10 CELLPADDING =0 BORDER =0> 

-<<TR><<TD><<A HREF= "http://www.picosearch.com"> 

< <IMG BORDER="0" 

SRC = " http:// www . picosearch . com/picosmall . gif ' 
ALT = " PicoSearch " >< < / A >< < /TD > 
15 < < TD >< < INPUT TYPE = "TEXT" NAME = "query " VALUE = " " 

SIZE="20"> < </TD> 

< < TD >< < INPUT TYPE= "SUBMIT" 
VALUE = "Search" > < </TD> < </TR> 

< </TD> < </TR> < </TABLE> < </TABLE> < </FORM> 
20 < < !- End PicoSearch Code ~> 
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WHAT IS CLAIMED IS : 

1 . A method of identifying which documents in a database 
contain user-specified data, the method comprising the steps of: 

storing, in memory, indices of data in plural databases, the 
data being indexed by document, location in the document, and a provider 
identifier; 

receiving the user-specified data and a provider identifier 
which corresponds to one of the plural databases; 

retrieving, from memory, an index of data for a database 
that corresponds to the provider identifier; and 

identiiying documents in the retrieved index that contain the 
user-specified data. 



15 



2. A method according to Claim 1, further comprising the 
step of outputting identities of the documents identified in the identifying 
step. 



3. A method according to Claim 1, wherein the database is 
20 located on the Internet. 



4. A method according to Claim 1, further comprising the steps 

of: 

searching for locations of the user-specified data in the 
25 documents identified in the identifying step; 

storing, in memory, compressed data which corresponds to 
data stored in each of the plural databases; 

generating pointers into the compressed data based on the 
search conducted in the searching step; 

selecting compressed data from memory that corresponds to 
the provider identifier; 
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decompressing segments of the compressed data selected in 
the selecting step that correspond to the pointers generated in the generating 
step; and 

outputting the decompressed segments. 

5 

5. A method according to Claim 4, wherein the pointers 
identify the user-specified data in the compressed data, together with a 
predetermined amount of adjacent data. 

6. A method according to Claim 5, wherein the amount 
and/or location of the adjacent data relative to the user-specified data are 
variable, 

7. A method according to Claim 4. wherein each segment of 
decompressed data includes a' document identifier which identifies a 
document containing the segment; and 

wherein the method further comprises the steps of: 
storing, in memory, locations of documents in the plural 
databases, together with corresponding document identifiers; 

extracting, from memory, locations of documents which 
contain the segments of decompressed data based on the document 
identifiers associated with the segments; and 

outputting the locations of the documents which contain the 
segments of decompressed data. 

8. A method according to Claim 6, wherein the locations of 
the documents comprise uniform resource locators, 

9. A method of retrieving a list of documents in a database 
which include user-specified data, and of retrieving one or more data 
segments from each document on the list, the method comprising the steps 
of: 

storing an index of documents from the database, the index 
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including pointers corresponding to data in the database, where the pointers 
define data segments having a predetermined size; 

identifying one or more documents in the database that 
contain the user- specified data based on the index; 
5 creating a list of one or more pointers corresponding to each 

occurrence of the user-specified data in the documents identified in the 
identifying step; 

extracting each data segment in the database that contains the 
user-specified data based on the list of pointers created in the creating step; 
10 and 

outputting a list of the documents in the database that contain 
the user-specified data, together with the extracted data segments. 



10. A method according to Claim 9, wherein the database is 
15 located on the Internet. 

11. A method according to Claim 10, wherein the list of 
documents comprises one or more uniform resource locators and/or 
uniform resource indicators. 



20 



25 



12. A method according to Claim 9, wherein the identifying 
step comprises performing a boolean search of the index for documents 
which contain the user-specified data. 



13. A method according to Claim 9, further comprising the 
step of storing, in memory, compressed data which corresponds to the data 
stored in the database; and 

wherein the extracting step comprises extracting each data 
segment from the compressed data and decompressing each extracted data 
30 segment prior to output in the outputting step. 

14. A method according to Claim 13, wherein each data 
segment includes a document identifier which identifies a document that 
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includes the data segment; 

wherein the method further comprises storing, in memory, 
locations of documents in the database, together with associated document 
identifiers for the documents; and 
5 wherein the outputting step generates the list of documents 

by comparing document identifiers from extracted data segments with 
document identifiers associated with locations of documents, extracting 
locations of documents corresponding to the extracted data segments based 
on the document identifiers, and including locations of the documents in the 
10 list of documents. 



15. A method of initiating a search at a first network site for 
user-specified data in a remote database at a second network site and of 
conducting the search at a third network site, the method comprising the step 
15 of: 

receiving, at the first network site, a provider identifier 
associated with the database from the second network site; 

inputting the user-specified data at the first network site; 

outputting the user-specified data and the provider identifier 
20 from the first network site to the third network site; and 

searching for the user-specified data in a database at the third 
network site using the provider identifier, where the database at the third 
network site includes data that corresponds to data stored in the remote 
database at the second network site. 

25 

16. A method according to Claim 15, wherein the third 
network site includes a memory which stores indices of data in plural 
databases, the plural databases including the remote database at the second 
network site, and the data in each index being indexed by document, 
30 location in each database, and provider identifier; and . 

wherein the searching step comprises: 

retrieving, from the memory, an index of data for a database 
that corresponds to the provider identifier; and 
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identifying documents in the index that contain the user- 
specified data. 

17. A method according to Claim 16, wherein the database 
5 at the third network site is published on the Internet. 

18. A method of configuring a computerized searching 
system so that the searching system can be used to search a database 
comprised of one or more documents, the method comprising the steps of: 

10 inputting information identifying the database; 

assigning a provider identifier to the database; 

searching through the database using the information input in 
the inputting step in order to identify locations of documents in the 
database; 

15 storing, in memory, the locations of the documents in the 

database, together with the provider identifier; 

indexing the documents in the database; 
storing, in memory, an index of the documents generated in 
the indexing step, together with the provider identifier; 
20 storing, in memory, data corresponding to data in the 

database, together with the provider identifier; and 

outputting the provider identifier to the database. 
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19. A method according to Claim 18, further comprising the 

steps of: 

receiving user-specified data and the provider identifier for 

the database; 

retrieving, from memory, the index of documents for the 
database based on the provider identifier; and 

identifying documents in the index that contain the user- 
specified data. 



20. A method according to Claim 19, wherein the data 
corresponding to data in the database is stored in memory in compressed 
format; and 

wherein the method funher comprises the steps of: 

searching for the" user-specified data in the documents 
identified in the identifying step; 

generating pointers into the compressed data based on the 
search conducted in the searching step; 

selecting compressed data from memory that corresponds to 
the provider identifier; 

decompressing segments of the compressed data selected in 
the selecting step that correspond to the pointers generated in the generating 
step; and 

outputting the decompressed segments of compressed data, 
together with a list of documents containing the decompressed segments. 

21. An apparatus for identifying which documents in a 
database contain user-specified data, the apparatus comprising: 

memory means which stores computer-executable process 
steps and indices of data in plural databases, where the data is indexed by 
document, location in the document, and provider identifier; and 

a processor which executes the process steps so as (i) to 
receive the user-specified data and a provider identifier which corresponds 
to one of the plural databases, (ii) to retrieve, from memory, an index of 
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data for a database that corresponds to the provider identifier, and (iii) to 
identify documents in the retrieved index that contain the user-specified 
data. 

5 22. An apparatus according to Claim 21, wherein the 

processor funher executes process steps so as to output identities of the 
documents identified in the identifying step. 

23. An apparatus according to Claim 21, wherein the 
10 database is located on the Internet. 

24. An apparatus according to Claim 21, wherein the 
processor further executes process steps so as (i) to search for locations of 
the user-specified data in the documents identified in the identifying step, 

15 (ii) to store, in memory, compressed data which corresponds to data stored 

in each of the plural databases, (iii) to generate pointers into the 
compressed data based on the search conducted in the searching step, (iv) 
to select compressed data from memory that corresponds to the provider 
identifier, (v) to decompress segments of the compressed data selected in 

10 the selecting step that correspond to the pointers generated in the generating 

step, and (vi) to output the decompressed segments. 

25. An apparatus according to Claim 24, wherein the 
pointers identify the user-specified data in the compressed data, together 

15 with a predetermined amount of data adjacent to the user-specified data. 
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26. An apparatus according to Claim 25, wherein the 
amount and/or location of the adjacent data relative to the user-specified 
data are variable. 

27. An apparatus according to Claim 24, wherein each 
segment of decompressed data includes a document identifier which 
identifies a document containing the segment; and 

wherein the processor further executes process steps so as (i) 
to store, in memory, locations of documents in the plural databases, 
together with corresponding document identifiers, (ii) to extract, from 
memory, locations of documents which contain the segments of 
decompressed data based on the document identifiers associated with the 
segments, and (iii) to output the locations of the documents which contain 
the segments of decompressed data. 

28. An apparatus according to Claim 26, wherein the 
locations of the documents comprise uniform resource locators. 

29. An apparatus for retrieving a list of documents in a 
database which include user-specified data, and for retrieving one or more 
data segments from each document on the list, the apparatus comprising: 

memory means which stores computer-executable process 
steps and an index of documents from the database, the index including 
pointers corresponding to data in the database, where the pointers define 
data segments having a predetermined size; and 

a processor which executes the process steps so as (i) to 
identify one or more documents in the database that contain the user- 
specified data based on the index, (ii) to create a list of one or more 
pointers corresponding to each occurrence of the user-specified data in the 
documents identified in the identifying step, (iii) to extract each data 
segment in the database that contains the user-specified data based on the 
list of pointers created in the creating step, and (iv) to output a list of the 
documents in the database that contain the user-specified data, together with 
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the extracted data segments. 

30. An apparatus according to Claim 29. wherein the 
database is located on the Internet. 

31. An apparatus according to Claim 30, wherein the hst of 
documents comprises one or more uniform resource locators and/or 
uniform resource indicators. 

32. An apparatus according to Claim 29, wherein the 
identifying step executed by the processor comprises performing a boolean 
search of the index for documents which contain the user-specified data. 

33. An apparams according to Claim 29, wherein the 
processor further executes process steps so as to store, in memory, 
compressed data which corresponds to data stored in the database; and 

wherein, in the extracting step, the processor extracts each 
data segment from the compressed data and decompresses each extracted 
data segment prior to output. 

34. An apparatus according to Claim 33, wherein each data 
segment includes a document identifier which identifies a document that 
includes the data segment; 

wherein the processor further executes process steps so as to 
store, in memory, locations of documents in the database, together with 
associated document identifiers for the documents; and 

wherein, in the outputting step, the processor generates the 
list of documents by comparing document identifiers from extracted data 
segments with document identifiers associated with locations of documents, 
extracting locations of documents corresponding to the extracted data 
segments based on the document identifiers, and including locations of the 
documents in the list of documents. 
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10 



35. A system for initiating a search at a first network site 
for user-specified data in a remote database at a second network site and 
for conducting the search at a third network site, the system comprising: 

a first network site which executes process steps so as (i) to 
receive a provider identifier associated with the database from the second 
network site, (ii) to input the user-specified data, and (iii) to output the 
user-specified data and the provider identifier from the first network site to 
the third network site; and 

a third network site which executes process steps so as to search 
for the user-specified data in a database at the third network site using the 
provider identifier, wherein the database at the third network site includes data 
that corresponds to data stored in the remote database at the second network 
site. 



^5 36, A system according to Claim 35, wherein the third 

network site comprises: 

memory means which stores the process steps and indices of 
data in plural databases, the plural databases including the remote database 
at the second network site, and the data in each index being indexed by 
20 document, location in the database, and provider identifier; and 

a processor which executes the process steps so as to search 
for the user-specified data by (i) retrieving, from memory, an index of data 
for a database based on the provider identifier, and (ii) identifying 
documents in the index that contain the user-specified data. 



25 



30 



37. A system according to Claim 36, wherein the database 
at the third network site is published on the Internet. 

38. An apparatus for configuring a computerized searching 
system so that the searching system can be used to search a database 
comprised of one or more documents, the apparatus comprising: 

a memory which stores computer-executable process steps; 

and 

-28- 



BNSDOCID: ^ wo 000e572A1 I > 



wo 00/08572 



PCT/US99/17359 



a processor which executes the process steps so as (i) to 
input information identifying the database, (ii) to assign a provider 
identifier to the database, (iii) to search through the database using the 
information input in the inputting step in order to identify locations of 
documents in the database, (iv) to store, in memory, the locations of the 
documents in the database, together with the provider identifier, (v) to 
index the documents in the database, (vi) to store, in memory, an index of 
the documents generated in the indexing step, together with the provider 
identifier, (vii) to store, in memory, data corresponding to data in the 
database, together with the provider identifier, and (viii) to output the 
provider identifier to the database. 



39, An apparatus according to Claim 38, wherein the 
processor further executes process steps so as (i) to receive user-specified 
data and the provider identifier for the database, (ii) to retrieve, from 
memory, the index of documents for the database based on the provider 
identifier, and (iii) to identify documents in the index that contain the user- 
specified data. 



40. An apparatus according to Claim 39, wherein the data 
corresponding to data in the database is stored in memory in compressed 
format; and 

wherein the processor further executes process steps so as (i) 
to search for the user-specified data in the documents identified in the 
identifying step, (ii) to generate pointers into the compressed data based on 
the search conducted in the searching step, (iii) to select compressed data 
ft-om memory that corresponds to the provider identifier, (iv) to 
decompress segments of the compressed data selected in the selecting step 
that correspond to the pointers generated in the generating step, and (v) to 
output the decompressed segments of data, together with a list of 
documents containing the decompressed segments. 



41. Computer-executable process steps stored on a 
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computer- readable medium, the computer-executable process steps to 
identify which documents in a database contain user-specified data, the 
computer-executable process steps comprising: 

code to store, in memory, indices of data in plural databases, 
5 the data being indexed by document, location in the document, and 

provider identifier; 

code to receive the user-specified data and a provider 
identifier which corresponds to one of the plural databases; 

code to retrieve, from memory, an index of data for a 
10 database that corresponds to the provider identifier; and 

code to identify documents in the retrieved index that contain 
the user-specified data. 

42. Computer-executable process steps according to Claim 
15 41, further comprising code to output identities of the documents identified 

by the identifying code. 

43. Computer-executable process steps according to Claim 
41, wherein the database is located on the Internet. 



20 



44. Computer-executable process steps according to Claim 
41, further comprising: 

code to search for locations of the user-specified data in the 
documents identified by the identifying code; 
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code to store, in memory, compressed data which 
corresponds to data stored in each of the plural databases; 

code to generate pointers into the compressed data based on 
the search conducted by the searching code; 
5 code to select compressed data from memory that 

corresponds to the provider identifier; 

code to decompress segments of the compressed data selected 
by the selecting code that correspond to the pointers generated by the 
generating code; and 
10 code to output the decompressed data segments. 

45. Computer-executable process steps according to Claim 
44, wherein the pointers identify the user-specified data in the compressed 
data, together with a predetermined amount of data adjacent to the user- 
15 specified data. 



20 



46. Computer-executable process steps according to Claim 
45, wherein the amount and/or location of the data adjacent to the user- 
specified data are variable. 



47. Computer-executable process steps according to Claim 
44, wherein each segment of decompressed data includes a document 
identifier which identifies a document containing the segment; and 

wherein the computer-executable process steps further 

25 comprise: 

code to store, in memory, locations of documents in the 
plural databases, together with corresponding document identifiers; 

code to extract, from memory, locations of documents which 
contain the segments of decompressed data based on the document 
30 identifiers associated with the segments; and 
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code to output the locations of the documents which contain 
the segments of decompressed data. 

48. Computer-executable process steps according to Claim 
46, wherein the locations of the documents comprise uniform resource 
locators. 

49. Computer-executable process steps stored on a 
computer-readable medium, the computer-executable process steps to 
retrieve a list of documents in a database which include user-specified data, 
and to retrieve one or more data segments from each document on the list, 
the computer-executable process steps comprising: 

code to store an index of documents from the database, the 
index including pointers corresponding to data in the database, where the 
pointers define data segments having a predetermined size; 

code to identify one or more documents in the database that 
contain the user-specified data based on the index; 

code to create a list of one or more pointers corresponding to 
each occurrence of the user-specified data in the documents identified by 
the identifying code; 

code to extract each data segment in the database that 
contains the user-specified data based on the list of pointers created by the 
creating code; and 

code to output a list of the documents in the database that 
contain the user-specified data, together with the extracted data segments. 

50. Computer-executable process steps according to Claim 
49, wherein the database is located on the Internet. ^ 
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51. Computer-executable process steps according to Claim 
50, wherein the list of documents comprises one or more uniform resource 
locators and/or uniform resource indicators. 

52. Computer-executable process steps according to Claim 
49, wherein the identifying code comprises code to perform a boolean 
search of the index for documents which contain the user-specified data. 

53. Computer-executable process steps according to Claim 
49, further comprising code to store, in memory, compressed data which 
corresponds to data stored in the dataljase; 

wherein the extracting code comprises code to extract each 
data segment from the compressed data and to decompress each extracted 
data segment prior to output by the outputting code. 

54. Computer-executable process steps according to Claim 
53, wherein each data segment includes a document identifier which 
identifies a document that includes the data segment; 

wherein the computer-executable process steps further 
comprise code to store, in memory, locations of documents in the database, 
together with associated document identifiers for the documents; and 

wherein the outputting code comprises code to generate the 
list of documents by comparing document identifiers from extracted data 
segments with document identifiers associated with locations of documents, 
extracting locations of documents corresponding to the extracted data 
segments based on the document identifiers, and including locations of the 
documents in the list of documents. 
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55, Computer-executable process steps stored on a computer- 
readable medium, the computer-executable process steps to initiate a search 
at a first network site for user-specified data in a remote database at a 
second network site and to conduct the search at a third network site, the 
5 computer-executable process steps comprising: 

code to receive, at the first network site, a provider identifier 
associated with the database from the second network site; 

code to input the user-specified data at the first network site; 

code to output the user-specified data and the provider 
10 identifier from the first network site to the third network site; and 

code to search for the user-specified data in a database at the 
third network site using the provider identifier, where the database at the 
third network site includes data that corresponds to data stored in the 
remote database at the second network site. 

15 

56. Computer-executable process steps according to Claim 

55, wherein the third network site includes a memory which stores indices 
of data in plural databases, the plural databases including the remote 
database at the second network site, and the data in each index being 

20 indexed by document, location in the database, and provider idenfifier; and 

wherein the searching code comprises: 
code to retrieve, from memory, an index of data for a 
database that corresponds to the provider identifier; and 

code to identify documents in the index that contain the user- 

25 specified data. 

57. Computer-executable process steps according to Claim 

56, wherein the database at the third network site is published on the 
Internet. 
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58. Computer-executable process steps stored on a computer- 
readable medium, the computer-executable process steps to configure a 
computerized searching system so that the searching system can be used to 
search a database comprised of one or more documents, the computer- 
5 executable process steps comprising: 

code to input information identifying the database; 

code to assign a provider identifier to the database; 

code to search through the database using the information 
input by the inputting code in order to identify locations of documents in 
10 the database; 

code to store, in memory, the locations of the documents in 
the database, together with the provider identifier; 

code to index the documents in the database; 
code to store, in memory, an index of the documents 
15 generated in the indexing step, together with the provider identifier; 

code to store, in memory, data corresponding to data in the 
database, together with the provider identifier; and 

code to output the provider identifier to the database. 

•0 59. Computer-executable process steps according to Claim 

58, further comprising: 

code to receive user-specified data and the provider identifier 
for the database; 

code to retrieve, from memory, the index of documents for 
5 the database based on the provider identifier; and 

code to identify documents in the index that contain the user- 
specified data. 
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60. Computer-executable process steps according to Claim 
59, wherein the data corresponding to data in the database is stored in 
memory in compressed format; and 

wherein the computer-executable process steps further 

5 comprise: 

code to search for the user-specified data in the documents 
identified by the identifying code; 

code to generate pointers into the compressed data based on 
the search conducted by the searching code; 
0 code to select compressed data from memory that 

corresponds to the provider identifier; - 

code to decompress segments of the compressed data selected 
by the selecting code that correspond to the pointers generated by the 
generating code; and 
5 code to output the decompressed segments of compressed 

data, together with a list of documents containing the decompressed 
segments. 
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